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Abstract ' \ 



. '• •• . ..\\ v • . '■ . . - - .. •: - * ■• • • . - ■■ : 

Yearlong case studies of 'instructional practices in elementary school* 
mathematics were analyzed to describe testing practices of teachers and the 
-Us e s made test results . The v focus of the study was on curr,iculum^embedded 
tests; the use of results of c ur r 1 c ul urn- emb edd ed tests for placement of 
students , content decisions, and student evaluation were given special atten- 
tion. Ever^ time teachers made placement decisions (e.g. f to create hbraogene- 
ous groups within classrooms) , some form of curriculum—embedded test was used 
In the making of content decisions about topic selection and pacings four dls- 

. •'• ■ : ' \ . ' • 1 , \. ' ' ' ' • ■ 

tinct styles of currfculum-embedded test usage emergedi (1) not monitoring 
student progress and, referring to tests rarely^ if ever| (2) relying on in- 
formal assessments (nojt labeled as tests) that; are conducted with the intent 
of determining adequacy of student und er s t and ing ; (3) using curriculum- 
embedded tests to make the domain for mastery clear and public but not \to 
actually monitor student progress ; ajtyi (4 \ using the results of cWrlculum- 
embedded tests as the sole criterion for deciding when a student can move on 
to- a new 'topic. ' Overall,- the data suggest that c ur r ic ul um-embedd ed ; t e s t s are 
important for, some purposes \ but *rfot for others. ^ \ 
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• • j : • - •/ 

DIFFERENCES AMONG TEACHERS IN THEIR, USE OF 
/ CURRICULUM-EMBEDDED TESTS 

■ ■■ ., . <, , . v ' • ' . ■' : , 

Therese Kuhs , Andrew Porter, Robert Floden, 
Donald Freeman, William Schmidt, and John SchwilleJ 

In spite of the continuing debates about testing, empirical studies of 



how teachers, use tests are few* Host of these investigations of te^t usage 
consider only nationally available standardised tests (e.g., Kellegan, ^Madaus , 
& Airasian, 1982)* However , the tests that are, now- most ^strongly recc . ided 
are not standardized tests, but tests that^ are clearly tied to the Sf^hw . cur- 
riculum and are an integral part of teachers* daily instruction (e.g. , Tyler & 
White, 1979; Bloom, "1981; Edmonds, 1979)* We use the term ouvTveulnm^enibeddBd 
to describe such tests. 

Perhaps curriculum-embedded tests have received so little attention in 
the research literature because they lack a clear definition. In the class- 
room it is 1 often difficult to know what constitutes attest and what does not. 
Elementary school teachers can monitor student achievement more or less eon*= 
tinuously. They observe students doing assignments and check the accuracy of 

. *r : ■ •• •• • , v ■ . 

completed assignments. . One might conclude, therefore, that most teachers are 
(testing some students most of the time. On the other , hand, a textbook exer- 
cise labeled as a chapter test might be used by the teacher as just ^another 
exercise* Which %m more the test, a a classNdiscussipn that convince-s^the 



f Theresa Kuhs, formerly a research intern with the IRT's Content 
Determinants Project, now teaches at the University of South Carolina. Andrew 
Barter, co-director of the 1RT and associate dean of MSU f s College of 
/Education, coordinates the Content Determinants Project. Robert Floden, 

Donald Freeman, William Schmidt, and John Schwille are IRT senior researchers 
and members of the Content Determinants Project. Floden is an associate pro- 
fessor of teacher education! Freeman is a. prof essor of teacher education! 
Schmidt is the chairperson of the Department of Counseling ,* Educational 
Psychology, and Special Education! and Schwille' is a, professor of teacher edu- 
cation. ' ; ,,< ■' - \\ a 




teacher that students 'are confused about vc a chapter test 

given by a substitute teacher because tb lu^ar tc -.tw .oesn't trust the 

substitute to carry on normal instruct! * 

A second possible reason^f or rese; ' r z ? 1- ^ ctention to , \ 
curriculum-embedded tests may be that e, in general^ less contro- 
versial than externally required stan * What little empirical 



research has bean done on the "uses of t , ^nerally ^feen inspired by 

criticisms of externally required tests, . * 

Whatever the reason fcfr the lack of research^on teachers 1 use of 

curriculum-embedded tests, these tests can and sometimes *do represent an im^ 

' * , .1 . ■ 

portant part off • the instructional "process and .are worthy of bur attention* - A 

* i=- ■ • . ■ • 

review of teacher uses of assessment devices (Rudman, Kelly, Wand us , Mehrens, 

Clark, & Porter, 1980) found the only testing practices consistently tied to 

positive effects on student achievement were the numerous variations on the 

mastery learning theme (i.e., objectives are specified and student progress is 

consistently monitored through tests tied to those objectives). A recent' * 

National Institute of Education (NIE) conference on testing recommended more 

research and development on how to create instructional environments in which 

"testing is merged* into the teaching process and provides timely "land rich 

feedback to students, the teacher, and other appropriate parties" (Tyler & 



White, 1979,. p. 5)* Nationally available standardized tests cannot meet this 
need. These national, tests can neither Be tied to the instruction ofan in- 
dividual teacher,, nor given with the frequency the above statement implies* 5 

^ Center for the Study of Evaluation (CSE) study of teacher testing prac- 
tices (Burry, Catterall, Chopping & Dorr-Bremme > 1982) addresses .teachers' use 
of * curriculum-embedded tests in elementary school. This survey of fourth- and 
lixth-grade teachers in a national , probability sample ,of school districts is 



it^oortant to Vhe \md ©^standing of current practices. For example, the CSE 
survey found teachers reporting an average of 23 tests per year when teaching 
elementary school mathematics (approximately ^one test every week and a half - - 
over a full school year). The, teachers reported that nearly 80% of this test- 
ing involved curriculum-embedded tests. * ; <■ 

Here, we take a much more microscopic view of curriculum-embedded tests 
and suggest, that the QB% averages conceal much variation in teacher practice.' 

- .. ; ' v • ' m, ' ,. ... ;; ..' ■*;■. .1 : ■' . .... .;; ■ ' 

Our data suggest that for some purposes , curriculum— embedded tests are impor- 
tant and for other purposes they are not. ■'■•?'' 

■ - * . -- .*'•*■' ■« 

■ - • ; . ■ ' " ■ • f : : ., ■ " \ ■ . . V" ■ r . 

Testing Within A Study of Teacher Con tent Decisions in Mathematics ' f 
Our data on te^cher^s 1 use of curriculum embedded^ tests come from an, • . v 
intensive stftidy of factors ^that may influence teachers 1 content decisions in 



from each of two schools in 



elementary— school, mathematics* We conducted, year^-long case studies -of seven 
teachers in grades 3 , A* and* 5* The" teachers came 
each of three Michigan schogj districts. 

Th<L three school districts were selected for differences in* type and 
strength of district politt^ that could influence teacher content decisions 
in mathematics. Knoxport,2 a large urban district, had a management ,^y /objee^ 
tives (MBO) program in elementary school mathematics- Sawyer, a rural small^ 

: ■ ^ "•" . ' " • / . ■ .' 

town district that had become somewhat suburban^ had just adopted a new 
distrtct^wide textbook series for use in elementary school mathematics. Finn, 
similar to Sawyer deAographieaily , had* a policy of building' autonomy for 
mathematics content decision making. Hi thin eT^ch district t we, selected for 
study two schools that differed in the extent to which teachers worked in " „ 
self-contained (and Isolated) classrooms . 



^Names of school districts, schools, and teachers have been changed. 



The seven teachers Who participated in the study were nominated by their ^ 
building principals to meet the following specifications : , (!,> teach in grades- 
3 through 5,(2) be < neither new to teaching (have at least three years of » 
experience) nor near retirement, and (3) be willing to participate In the .* ' 
study ;. '\ - : . \ . ,- "„ S " - ' 

In Sawyer, both teachers studied taught* fourth gr,ade. In Finn ^ we 
studied one third- and* one four th^rade teacher- In Knoxport, two* fourth- * ; 

■ ' ' .v* b " .: . * / v * . . . ■ • ■ . ' / 

grade teachers (in* separate bWldfngs) and one fifth^grade teacher were stud-^ 

■ - :\ . " A; - - ." . | - • .... , ; . ■ 

led* In Knox port , we selected t^o of ^ these teachers from the same ^Vehool^ to . 
get a sense- tor possible wi thin-school differences in content decision making.: 

Because thls^study- focused on the mathematics instruction of just a; few 

" = _ -' .. .'. +' ■ ■ v , :;t v "V . ■ . ■ . , * . /. t v.* 

"teachers and was not limited to a description of testing and test 4 use, it has * 

potential for making a unique ^contribution. to the understanding of ^teachers* 

uses of curriculiim—embedded tests % The c 4s e*^ study methodology provided data 

to support aS^ho rough under standi tig of each . teacher f s approach to the, teach-* . 

ing of mathematics . ^ Since the data collected did not focus on testing more 

than bn other aspects of instruction > data^coilecffpn ' procedures were unlikely - 

to lead teachfers to overreport^ test use or modify testing practice in response y 

to the study- t ^ ^ 

Teachers kept daily logs of. their instructional*" activities in mathematics 

for an entire school" year (1979-80) • In these logs' tfhey recorded the amount 

of time they spent on mathematics , their Instructional goals s and the teaching 



strategies they ^s eds( Including testing)* They alijd noted all differences 
among students in content taught and provided copies of all materials used. 
Teacher logs were collected each week and checked for completeness. 
> /. The teachers were not always clear in their log* reports as to ^bether or 
not. ; they considered an instructional activity a test* The criteria used to 

••• ■ .. 4 : ' - ' : - . 
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classify activities listed in the logs as tests were as follows i (a) the 
teacher referred to the activity as a test or (to) the activity made use of a 
.test page from published instructional materials, '". . 

, During weekly contacts, teachers were also interviewed about, their use o 
textbooks, tests , objectives, and any conversations or newly-received matVri- 

4 als related to mathematics. Lengthy interviews were conducted with the teach 

; ; . * < M ■ . . 

era at the beginning of the year to ascertain their intentions -and priorities 

and at the end of the year to probe their reactions to possible curriculum in* 

i ... ■. - f ■ * ' ■ ' 

fluences. A limited amount of classroom observation was wdone to enhance our ' 

understanding of our other data. ' 

; ; & r ■ 

WhTlre 'the focus on mathematics was useful in getting a full understandinj 
rv . * ■ ■ * . F * ■ 

- ■ . , \ • 

pf teachers' testing practices, it may be that some results will not apply to 

other subjects. For example, Yeh (1978) found that teachers test more fre- 
quently in mathematics than in reading. In her study* of elementary school 
teachers from five California districts, approximately 78% of the teachers 
reported giving math tests once a/tfonth or more, while e qnly, 68% reported test- 
ing that frequently in reading,/ Grade level is another factor that potential- 
ly limits the generalizability of this study, Yeh found that there was an 
in fuse relationship between grade level; and amount of testing. Third- and 
fourth-grade teachers were** the most frequent users of tests developed by 
^others (e.g,y textbook chapter tests) , while fifth-? .and sixth-grade teachers 
were the most likely to .develop their own tests, Still, 52% of the third- and 
fourth-grade teachers reported at least some pmmt development, 

1 v ; ■ . " 

The study is also limited by having been conducted in three districts in 

* » ■ * , 

one area of Michigan* These districts do not represent the full range of 

\£ ' • - ' « 

variation in -state and district test policies, Yeh (1978) leads us td*suspect 

an inverse relationship letween the frequency o f" use of curriculum-embidded 



tests and the amount of district or state required testing. Moreover, in the 
same study, localiy-*deveioped (particularly teaeher-made) tests were more 
often used for making instructional decisions other than placement , while 
required tests were, used for reporting to others , particularly parents* 

The state of Michigan requires testing in mathematics of all 4th', 7th, 
and 10th graders. At the time of our study, the state tests were objective- 
referenced and, for fourth-grade mathematics , contained 160 Items to assess 33 
objectives. The objectives were described by the Wate as minimal* and re^ 
suits were reported in terms of number of 4 objectives master ed^C where ^hree of 
five items correct was taken as mastery)* 

In addition to required state testing, each of the three districts had 

■* 'a 

■■ \ I " 

• ^ ' ' _ ■ _ 9 

its own testing program, In Knoxport, the Stanford Achievement jsattery* was 
administered in all elementary grades each spring. Sawyer administered the 
Metropolitan Achievement Test each spring in third and sixth grade, while Finn 
used the Stanford , to test students in grades 2 through' 5 each fall* 

_ , Types of Test Use m ■ , . ■ - ■ " 

In describing teachers 1 3 use of curriculum— embedded' tests, we have tried 

to recognise when tests are not used mb well as when they are* We have 

categorized the uses of tests as follows. 3 

I. for placement of students , including afl^EJnment to classrooms 
, ' . within a grade and assignment to instructional groups within 
1 a classroom; 

2* for^ making decisions about what content topics are to be taught, ^ 
for how long, arid to what" standards of performancei and 



^The categories are taken,, in part, from categories of test uie iden- 
tified by Tyler and White (1979)# Yeh (1978), arid, in part, our own conception 
of teacher content decisions (SchwUle, Porter, Belli* Ploden, Freeman, l 
Knappeh, Kuhs , h Schmidt, 1983) . \ 



/" *3* for student Evaluation, including grading and reporting to 
others, *• ** K . 

■ ' . ,".."'■'■* * • ■ 

■ ' - - • • s ; : ' ; * . ".- " ' ; • ' 

Knoxport Distrdct 

In Knoxpbrt , curriculum-embedded tests were very important to Andy , Lucy , 
and Teri, the threa teachers studied- To a great extent, the tests determined 
the content that students studied during the year* The Management By * 
Objectives (MBO) system was the source of much of this cur ricuium^erabedded 
testing in mathepatics.^ The MBO system provided (1) tests that could be used 
at the beginning of tjie scnool year to place students at the appropriate ob^- 
jective in' the hierarchy of objectives defined by the MBO ffrogram, (2) mastery 
test/a for each objective, (3) review tests /Covering, subsets : .of objectives , *»nd 
(4) end^of-year tests for grtfdes 4 through 6, 

PI acemen t*^p f\ students * Each of the three Knmtport* teachers used MBO 

curricuium=embedded tests to place all of their studentp in the "district 

single-strand mathematics objectives - Their placement procedures were not the 

same, however, nor were the consequences* , 

■ , * * • 

The district required teachers to, use the MO placement tesjfc to identify 

children for Title I compensatory education programs • This test consisted of 
a few items related to each obj ective over the range of grade levels in the 
MBO system. Of the two teachers we- studied' in the same building, Lucy relied 
solely on this test in placing all her students in the MBO singlets t rand ob- 
jectives and Teri , in contrast, used placement-test results to select a dif- 
ferent series of obj ectlve^referenced* pretests for each student, Teri dpi not 
complete her elaborate placement procedure until nearly the end of the first 
semester, ' s * - ' 

Since the third teacher', Andy, did not teach in a Title I eligible 
school, he was not required to use the district placement test> Instead, In 



placing his students. He relied on a district-provided locator test that 
•covered fewer objectives but included a larger number of items per -objective.' 

While the three teachers differed/ somewhat in how they made placement" 
decisions, these decisions were similar In one important respect i Students ' i 

within the same classroom differed greatly as to the objective on which they 

C ** , ' * - * ^ s * ~ _ * 

"■ * " ■ • * % ' . .. . • • /. ■ ' v - ; 

were to begin instruction* For example, one of- .Andy 1 s students was- placed *6n 

'" ■* \ • .*. ' • " ■', '• ■' . > : ' .. • : . > ■ : 

Objective 2, and, at the other extreme, two of hia'ophar students were/ placed 

on - Objective 50. - . * \ * } 

In other respects, because the teachers differed id their" instructional 

routines X, their decisions about student placement had quite different conse- 

quences. In Andy 1 s classroom, mathematics instruction was entirely self^- 

paced. ^ Once students had been placed in the HBO sequence of objectives, they 

■ • "v<__. - ■ ■. ' .. . ■ ■ 

worked independently on topics in the hierarchy until" they mastered all the 

objectives up to a point determined by Andy (this point was always beyond 

grade level). At such time, students continued to study independently, pursu-* 

^ing an enrichment prdgram that wa*s not governed by the MO system. Lucy and\ * 

Terl, on the other .hand, offered dual programs of mathematics instruction. In 

Lucy's classroom, students pursued self— paced studies based on MBO placement 

iti the mornings, ^en, later in the day, lucy provided ^whole-class ins true- 

tion \using a different textbook from "that used for work on the objectives. 

Originally , Teri, planned to do whole-class instruction three days each week i 

, * » . " ■ "•»" t * § ' . ' ' I 

and to, provide the MBO-guided, self spaced experience on the o.ther two days - 

But, in part because of her more elaborate procedures for student placement, 

• ■ • • • * " • * .. ■■ \** 

she did not begin individualised work on the MBO* objectives until the end of 

February. a » *^ **, 

* *"..."' * * ***•."*" ■ i 

In Andy f s classroom, the MBO placement wafl aiSiajor determinant of what , 

each cHild studied during the year; hence, his; use of the more thorough set . 
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of tests , for placement can be readily understood* In Lucy's classroom s the 
.OTO placement affected only a portion of. a child* 8 experience with mathe^ 
iaa t i c 8 , and '/ the mand a ted use of the less ex ac ting . MBO pi aeeraent test was ac- 
cepted* Ironically, Teri* s demand for more accurate data postponed her 
placement decisions and made- the consequences of placemen© less important . 

Selection of topics' and pacing . To the extent that' teachers stayed with* 
in the' MBO system, - topic selection was controlled by the sing le- g t r and ob j ec- 
tives , and pacing was monitored by performance on the ob j ective-ref erenced 
posttests provided by the district. As has /been noted, however, each of tine 
three teachers used additional^mater^a^s tor instruction outside the MBO sys-^ 
tern. i „ 

Andy described what he called his enrichment program as the remainder .of 
the fourth-grade textbooks Students worked on the enrictaent program after 
finishing the prescribed MBO objectives^ for their gra&e level* Once students 
completed enrichment exercises , they returned to the [MBO objectives for fur- 
ther work* Although individual -students in his class were tested more often 
than students in any of the 'other classes studied, the testing was limited to 
MBO objectives. : \ ."^V \ ■ 

' In^ Andy 1 s classroom, one-third of the students pursued the enrichment 
experiences in the text tor over; y a quarter of the school year. During that 
time, no tests were administered. Students worked independently, and the only 
teacher-student 'interactions were student initiated. Thus^Vuhless a student 
requested assistance, even informal assessments were not made. . 

During self-paced instruction in Lucy's classroom, decisions about topic 
selection and pacing were controlled by MBO objectives' and tests.' But Lucy 
*also covered an entire, separate, fourth-grade textbqok using whole^elass 
instruction, presenting each lesson in the sequence in wftch it appeared in 



the text . As did Aridjr in "his enrichment program, Lucy chose not to test 
* students on the material presented in these afternoon sessions. 
* Thus , Lucy I s students wer4l assessed only on MBO content • Further r be- 
^cause students were progressing at different speeds through the objectives, 
testing of a topic often did not coincide with the time the topic was present^ 
ed in whole-class lessons. Clearly, any instructional decisions made concern- 
ing the afternoon sessions in Lucy* s, class were based on informal assessments * 
of student^progress — if, indeed, student mastery was considered at all v Be- 
cause the entire textbook was Covered; it seems, unlikely that student mastery 

; - j * - - 'V .. . ■ , .. .. ; ! i 

of. topics was given much consideration in determining .the pace of instruction 
for Lucy* s afternoon sessions • • - ' - 

Like Lucy and' Andy $ Teri chose to limit formal assessments to HBO post- 
tests* Unlike Lucy and Andy, she did not always tie the administration of a 
posttest to completion of instruction on the tested objective, Teri seemid 
more intent on identifying posttests students could pass than on identifying 
objectives for which students needed remedial instruction. For example, she 
told her students that anyone who mastered 25 objectives would be invited to 
her home for a pizza party at the end of the year. When the end of the school 
year near ed and few students had reached this standard, Teri administered ja 
large number of objective posttests to all her students*^ ' * V , J 

The Knoxport teachers differed in how they decided when to give a student 
an objective-referenced posttest* At the beginning of the school year* tody 
left it up to the students hbotj^ecide when they were ready to take a mastery 
test. During the first week in February, Andy decided to exercise greater 
control over this decision* At first, he limited mastery testing to a 
specified day (each Friday). He explained that he was "seeing too much test- 
ing and failure and not enough willingness to do assignments." Two weeks 



later, tody" took even greater control over % when students -could, take a mastery 

, " ; v.'..- ■ ■"■ . v ..' v ■■ . ! v ;. ■ ■:. -' ;■ \ '. 

test by requiring students to solve orally one or two problems before taking a 

post test * Throughout the year g Lucy's strategy sBSLlar to the final 

strategy Andy employed* /.'* / * - ■; ■; ' = \ * 

Teri, on the other hand, * was :4ttore thorough in- monitoring students* work 

on objectives. She collected! and graded all completed objective-referenced 

assignments. Students were told at the 4 beginning of each self spaced .session . 

" wKethei" they "were to "db~another assignment or ~t§ke a mastery test on the work- 

they had done* Teri* s decision to personally monitor students* success on 

assignments created, a problem in that she, could not review the assignments 

quickly enougb for students to do self-paced work twice each week. As a re- 

Suit , only 26 lessons during the year were allocated to students* work on 

self-paced assignments , half of which occurred during the last month of . 

school. For reasons described earlier f much of this time was spent taking 

tests, * In fact, over these 26 lessons an average%f 29 testes per student were 

taken, * " • ' • . * . ■• : " ■ : ' - ~. 

Student evaluation and reporting to others . For the Knoxport teachers, 
the MBO tests were an important means of communicating what children had 
learned to per sems outside the Qlassroom, Student records of progress in the 
MBG system were reviewed by monitors sent .out, by the district's central a.^min^ 
istration and by the teachers the students would have next year. Further , all 
three* teachers we- studied reported on children's MBO profiles during parent 
conferences- . **- V . ' . 

Sawyer District ... 

In Sawyers curriculum-embedded tests were not very important to the 
teachers studied (Jacqueline and Wilma) , at least in comparison with their 



importance to the three Knoxport teachers. The teachers 1 o^Wconyictions, the 

. ^. u ..... _ ^ . . - ' ■ i; •-' - • , > 

practices of other .teachers s and, . for one teacher , the new district textbook 
were much more important in influencing student opportunities to learn* . # 

Placement of students , in Madly on Schools students were taught by teams 
of teachers. Jacqueline's fpurth/fifth— grade team used ability grouping in 
mathematics, language arts, reading, and, to a limited extent, in science and 
social studies* Team members spent the first ftwb weeks of school, teaching 
mathematics to homeroom students, reviewing skills they believed to have been 
covered in the preceding year* * ^ - 

Decisions about placing students in Instructional groups were made by 
team Members after these two j#eeks of review* The team administered a place- 
ment test ^they had developed five years earlier that cohered . a wide range of- 

topics, including place value; -concepts and. skills in addition, subtraction, 

- - - - ■ ■ . '■ • : . • - . 

and multiplication; story* problem solving; and some division problems* * . 

Jacqueline was assigned students who had done well on the placement/ test . 
For the mosf part, the membership of her mathematics group, remained stable 
throughout the year. 1 The primary exception" to stable group membership was 
also an exception to the use of placement test results as the main criterion 
in creating homogeneous groups * Initially, one emotionally impalrea and. math- 
ematically weak student was assigned to Jacqueline's mathematics class because 
she had earlier established good "rapport w^th this student. Later In "the 
year, two low-ability students were transferred to Jacqueline td^ make up a v ~re- 

medial grdup of three* During the entire year, "only one student was moved 

r ■ - „. - . . . • " ' ■ 

from another group to Jacqueline's hi^h-abllity mathematics group because of 

■ .. =: * _ ...... i~~J$^'- ' * : ' " ' ' = =■.*,.* 

better performance than placement test results had predicted. No students 

were moved out of her class to the lower ability group taught by one of the 

other members of the team. o. * <±\ ' 



For all but one month of the year, Jacqueline taught^ mathematics to two 
- groups, one high^abiiity and one remedial- During that month, the high- * 
ability students were Split into twrf groups based on written assignments and "\ 
Jacqueline's impression. of which students were ready to go on. to sturdy long 
* division and which needed further work *on multiplication. ^ Eventually , the two ^ 
groups were brought back together because* Jacqueline found It difficult to 
provide direet instruction to two high-ability gipups plus the low-ab^lty 
students- ^ . • "* . \ . 

Wilma , the other Sawyer teacher stud ied , was not? a member of a teaching 
team* and used only whole^class instruction, Thus , f or, Wilma, there were no 
placement decisions tojnake, with or without the aid .of. curriculum-embedded 
tesps, However, placement decisions made in fifth gr^ae, the next higher . 
grade, did hay.e v some influence on her mathematics instruction. At the begin- 
ning of fifth grade, students were given a placement tijst, written by fifth- 
and sixth-grade teachers in the building and used to homogeneously group, stu- 
> dents for the next two years of their mathematics instruction. Wilma * 
deliberately emphasised content that she knew was on that placement test, 



explaining » "after all, when these children finish and go on to fifth grade 
next year, the first thing they are going to get hit with in the fall if a, 
battery of tests measuring their skills addition, subtraction, multiplica- 
tion, anfl division," v 1 

- Selection of topics and pacing, To determine whether or not teachers 
used tests to make pacing, decisions , the teachers' log entries for -days fol^ 
lowing activities we classified as tests were analyzed- Fo r examp 7 add i- 
tional instruction on content just tested was taken as evidence that test 
performance led to remediation experiences. 
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'■■.'■.'-'>: . ■ ' y- : " . >•• ' ■ "'' : -l ' Sv ' r 

' . : • 1 .'. '" ..... 

€ * ■ ; . '- : Not . counting 1 tests of ba^lc factss Jacqueline 'record^d/formal testing of 
,her class 15 times during the year , or about once every wo 'and a half weeks - ~ 
Her log entries, however f provided little" evidence of/topic selection or pac-^: 
ing being based on those tests. Typically, on the day following a test, items 
were placed % on "the chalkboard as^- feedback' to BLudentB, then new content" was * 
introduced* *Qnly once was *there substantial remediation af ter (as opposed ^to 
before) testing. On that occasion, Jacqueline- focused -review on a particular 
group of test problems • 1 Students who had scored lower 'than B oh the test were 
required to individually review incorrectly answered problems with the .teach~ 

^ er. •' '" . " ■ - , ' , "-" ' ' = "I;. V 

For Jacqueline, chapter tests appeared to be an of ficlal certification df 

.mastery (i.e. , confirmation of what rr she lalready knew about student per for— , 
mance) rather ^ than a diagnostic inquiry that might direbt^lnstruction. One 

' evidence of* this was Jacqueline's practice of occasionally postponing ojr even 
omitting testing for students she considered 1 insufficiently prepared. For 
example, once she tutored three students while the class was taking a test. 
The three students then, took the test * later in the day- On another exception^ 
al occasloni when her remedial group had' been having great difficulty with the 
measurement chapter in the textbook, Jacqueline decided not to test at all, 
saying that the chapter was too hard for? the students and for once she would 
have to be content fwith exposure rather than mastery s - •; . 

In deciding wnether students *were ready to take tests and begin new 
topics, Jacqueline used day^to^-day written assignments as well" as students* 
/ general performance in class. In several ways these assignments were, for 
Jacqueline, formal assessments. She kept careful records, and students had to 
complete assignments, whether they were absent or pimply slow. Student per- 
formance on these assignments .influenced pacing. Ordinarily, in the year of 
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study, the high-ability students in Jacqueline's class moved along at a 
pacSv she considered satisfactory.;. Students who had occasional difficulty 
; received remediation; . students who| finished assigpme^ft early^ loften * did extra*- 
; credit assignments. Jacqueline routinely recorded /percent grades on assign- 
ments for communication to students and use in computing report-card averages. 
' - -^J^m X^P^^^^ ^ these written assignments illustrated in t " 
Jacqueline's decision to split her high-ability group ^or about a month in ; the 
late winter , letting some students go on to" a r new 1 to^cy while keeping others 
ori two-digit multiplication. J For some time she (had been ^uncertain about iow 

"~ r --y ^"y y ^ y ' ; ';" ." '.• . : ■ ""S r "y . ; v ' r .,VN?:y" iyy .y ..; " :yV t-^; 

to deal with the War led progress of her high-ability : s tudejits on two-digit 
multiplication, the first snag she reported having with^t his group all year. 
At first: she hoped to avoid splitting the group by continuing- to work indi- 
vidually with the students who were having di f £ icul ty . ^ She .thought a particu- 
^lar ditto (Holt ditto 183} would make, clfear whether ^Ts^udents were ready , to go 
^ohl- :But the decisibn was delayed still further; so many children were out * 
sick due to a f lu^epidemic that Jacqueline said she could not tell how they 
were doing until she got their make-up papers, , Yet the formal chapter test 
did not Influence her decision; Jacqueline did not administer the multiplr^a- 

tion chapter test to any of these students untiia.few days after she had 

I. ■ . * . ' ' ' \ ■ ' ■ . • '' 

s^lit her high-ability group* , * - \ - ■ ' 

Unlike Jacqueline, Wilma did not follow the textbook closely. For 

example, Wilma began the year by covering the /first 10 pages of the textbook, ^ 

but then skipped to page 59. From there, she skipped to page 50, then to page 

13, next to page 57, then Sack to page 20, and so on through . the year, Given 

= \ ' t = ■ *. • ^ . t 1 - . * '■■ ■ - 

this style of textbook usage, it was -not surprising that she made no use of 
textbook chapter tests* - " * 



Wiiraa did report H situations in her log that were classified as tests • 
Consistent with her emphasis on computational' skills »; six of the 11 , 



curriculum^embadded tests were on ha^c number facts (e*g*, multiplication 

-tables). '=>,--- . . ; . • 

■ .. \ .-■ ' V ■ v "hi ="^r . ••■ ■ .. 

f ^ Other lessons in .Wilma* a classroom, Jhowever t did seem to represent 

occasions where student progress was assessed- "This inference is based, on 

annotations Wilms made in her log entries about activities she did not^desig7 

Rate as Involving a tests The following ar# examples of this : j 

January 30 . Assignment to see how well students could divide . * . .' . . 



without help. v 



April 7 ^ Review—" to see what students remembered from 

before vacations" % 

May 1 3 Started lesson with independent assignment ? "to 

^ see how much students remembered , " '-_ \ 

May 22 Assignment M to see how- much students remembered ' 

- [■■■ ; • ' . - . from concepts taught throughout the year • " '■ ft% 

Nevertheless, Wilma ? s testing practices and iess^formal" assessments of 
student progress did, not change the pacing decisions she made at the beginning 

of the year.* In an early interview she listed mathematics ^content areas ^and 

• .. .. . ._ % . =___. _ __ ■_. .... _ . _ : _ : •' ... _. ... 

the dates she ■. projected for beginning instruction* She called this her "sub-. 

ject clock," and it was based on five years of teaching experience , four of 

which were at her current school* Table 1 Reports Wilma 1 s^projected parting 

dates and actual starting dates taken from the .logs* In all cases p the 

projected and actual dates were within one to tfro weeks of each other* 
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Accuracy Wilms 1 s Subject Clock 



Content ^rsa; A 



Addition 

Subtraction • * 

Mul tip! ieation 

Division ^ 

Fractions & Other- 
Peripheral Topics 



Place Value) 



Projected Starting Pate 

August 28 \. 
*, October 15 

i . • , r t ^ 

November 15 
sr - ~ ' February 1 

- April 7 -I % ' - 



Actual Date 
from Log 

August 28 

i 

October 9 
November 5 
January 21 
April 16 



Student evaluation and reporting to others . 
Sawyer , only Jacqueline made explicit use of curriculum-embedded ^ tests for 



Of the two teachers in, 

* • ! 



grading and to cpmmunicate learning outcomes in mathematics to -pets 6ns outside 
the classroom* Xft October Jacqueline explained \to her students; ;that 25% of 
their report— card grade for mathematics would-be; based on test performance » 
with the other 15% being-based oh average ^percent grades in daily assignments, 
Jacqueline's clearest use of attest to communicate student performance to 
parents occurred in a B unit on telling time. When another member of the team : 
identified telling time as an^a^ea of difficulty for students, , the team j'olnt- 
. planned four sessions of supplementary math instruction in which the. 

teachers 1 aides supervised instruction on telling time, 'Parents of students * 
who did poorly on a , post test rfeveloped^ ^Y Y^ another member of the team were 
sent letters indicating their <^hidd f ^ continuing deficiency and asking that 
they give their child practice at home on telling time. , 



* Finn District ■ \ , . = }■"-.•■ 

, In Finn, ^s in Sawyer > the teachers (Donna and Bob) made limited use of 
curricuium^emb<added tests, Donna, who taught in "an open^space school 9 used a 
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test to place students ^n^small* ability groups* but admiulsterea tests 

infrequently after the groups were formed. Bob, who taught a self-contained 

■ «-"••' — * " " •' --' .- . : -a - 1 

f ' ' " - ' \ -' - - - ■ ■ . * . . * 

classroom in Roosevelt School, made some use of tests for pacing but - ques- 
tioned the validity of certain chapter tests - that he might have used . - 

" ; ; - ^ ' _ ! • ,, v. N" • * ( ' v \. v 

' - ^Placement of students , N© attempt- was 'made In either of the, two schools 
to create homogeneous ability 'groups through initial^assignment of students to 
classrooms • Student placement was never an Issue in Bob f s class because * for 
the most^part, he taught new topics, to the. whole class. ^ \ ' 

Donna used whole-class instruction until November, when she divided her 
class into -two instructional groups* Prior to the formation of these groups, 
students completed four timed testis, on multiplication tables and two other 
tests covering a, broader range of content. Donha also had the results irom 
tfie f all Stanford Achievement testing* Nevertheless , she assigned students to 
Instructional, groups for mathematics based solely on a timed test of 100 
multiplication factrf (i.e. , a test of 100 items of the type 8, x 9 - ?). , 



Student performance on this currieuium^embedded test had substantial 

consequences* The higher ability group^pe^^ved instruction on topics that 

were not presented to the lower ability group (e.g., fractions, two-digit 

- if ; . = • " " - k V* - 

4 ~ 1 ' * 

division, and problems with three-digit multipliers) - Further , Donna* s * . 
comments reflected that she had, different expectations for the lower ability * 
group. For example, she explained that students in the lower ability .group . . 
could not think about too many things at any one time. 

Selection of topics LJ^gjj^ pacing * / Both Bob arfd Donna focused the^ir testing 
on the content they viewed as most important for students to master (e.g. , 
basic number facts) . Nine of the 12 basic^number-fact tests Bob used were, 
mastery quizzes from individualized packets of materials he had assembled. 
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Further, both teachers required emch student to demonstrate -mastery of multi- 
plication tables^ In ( an oral quiz. -Interviews with these teachers revealed 
that they viewed memorization of basic facts as one of the more important out- 
comes of mathematics studies. - ~ 

/ pob's interpretation of several test results further demonstrated that 
>his view of the relative importance of topics affected the way ,he used test 
result a..: Because he_ followed the scope and sequence of the textbook closely, 



it was convenient for him \ to use the chapter tests -from the- book for assess- 
ment • However , Bob , frequently discounted the importance of student errors on 
these tests* For example, a chapter test on subtraction included problems 
that we £# written as missing*-addend problems. While he had gone over the 
textbook*! treatment of the topic^ Bob explained he was not interested in 
knowing; whether or not students could do missing— addend problema. In another 

i - S 

instance. Bob said his students "really bombed" on a story-problem chapter 
test* Bob reported that the students had difficulty because the test used the 
~ phrase "how many does this make"* instead of "how many in all." Since Bob felt 
it pe^dagogicaily important not to confuse students with several different 
problem formats, he viewed students 1 difficulty as a problem with the book 
rather than as student failure* . 

In addition to the focus on computation, which she viewed as important. 
Donna's log entries revealed a particular pattern of testing. Before instruct 
tional groups were formed in November , Donna reported nine test events (six of 
these were tests of basic multiplication facts) . After the class was divided 
Into groups, testing was less frequent (i.e., only three tests in the high- ^ 
ability group and two in the low-ability group). Further, two of the three 
tests administered to the high-ability group were given on sequential days 
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when a substitute teacher was in the class * Instruction on days following 

tests did not deal extensively with the content tested* 

Despite her limited use of tests f Donna explained that she knew when 

students .were ready to go on to ^Something new by observing the accuracy of 

their work and by their ability to switch from one problem type to another 

without errors. In her log $ Donna reported many assessments that were not 

labeled as tests* Below are some exampiesi 

, December 6 "Students are having a rough time doing. the shorter 

"method. ,8 • • - ' • ^ *\ - \ 

January 4 "Group is still forgetting how to work multiplication - 

; *" problems ex* 36 x 48 and need more work at this." 

January 22 "The class caught on to division very quickly. It only 
seemed necessary to do one type of problem on page 194." 

February 12 "The group is still working on math packet mainly be— 
v cause of errors students are makingMn multiplying. \" 

They can do the process*" * . . 

- Student evaluation and reporting^ to others . The only evidence that Donna 
or Bob used test results' to report student progress to others occurred early 
in the school year* In October % Donna commented that the teacher^made test 
she gave "was mainly for report card evaluation," 

Since Bob frequently discounted the results of the tests he gave, he did 
not consider a total test score. valid for use in, reporting to others* He did, 
however, discuss students* performance on the Stanford Battery with parents^ 
during parent conferences , and he indicated that the parents showed consider— 
able interest in those results. 

Summary and Discussion of Findings 
In every case where placement decisions were made by teachers in the 
present study, either to homogeneously group students into classrooms or to 
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fom homogeneous, groups . within fora of cutricu^um**ambeddad 

test was used. Of the three a priori categories- of- test usage that formed , the 
framework for analysis , x . student placement provided the most frequent examples 
of :teacher reliance on curriculum-embedded test results. "V ■ * • ^ 

/ The three Knoxport 1 teachers used district MBQ tests to determine stu- 
dents' starting place for self-paced studies. Teri used. an even more elabo- 
rate system of * testing than the MBO system prescribed, and Andy chose to use 

the -ffloti detailed: of " the two available placement tests. The teachers who 

* ; _ ! ■ ■ , ' ■ ' =. ' . _ * ■. : ,;' ' ' ' 

chose to identify homogeneous instructional groups (Jacqueline, who grouped 

■ *. * ■* , . 

students across the teamed classrooms , and Donna, -who subdivided her class in- 
to two .groups) based student group membership on the results of a formal test. 
However, both teachers conducted inetruction , without grouping students for a 
period of time before administering the placement test j allowing informal 
observation to influence their placement decisions. Although the other two 
teachers did not make placement decisions , one of them was in a scnool whei^e 
teachers, at the next higher grade relied on a locally developed test for - s 
grouping students in mathematics. 



When making decisions about what topics to teach and how quickly to move 

■ . V 1 , 

from one topic to the next, four distinct styles of test usage emerged* 
Teachers displaying the first style did not monitor student progress and 
rarely, if ever, referred to tests. Each of the three Knoxport teachers, 
Lucy* Andy, and Teri, used this style when providing instruction not Included 
in the district's MBQ system. Teachers displaying the second style relied 
primarily on informal assessment, which they did not label as testing, to 
determine adequacy of student .understanding* Wilma, Bonna*, and Bob were most 
clearly identifiable with this style* The teacher displaying, the third 
style used tests to make the domain for mastery both clear and public, 



/but test results were redundant -as far as making decisions about -pacing " : * 
and -atopic selection were concerned. For example, Jacqueline used chapter \ 
'casts to mark the end of instruction on a topic and to conflrin her own day** 
by-day assessment of student progress • .We observed the-, fourth style of" test 
usage in the self-paced MBO components of Andy and, Lucy' s mathematics instruc- 
tion. There, tests were the sole criterion for determining when a student 
could move on to the next instructional objective. " 

Because -the amount of," curriculum— embedded testing varied considerably 
among the seven teachers, it is possible to speculate on the reasons teachers 
do or do not use tests* One plausible and parsimonious explanation is conve- , -i 
nienee* ! yhe Knoxport teachers were expected to provide instruction. on the MBO 
mathematics objectives and were 'provided* with copies of tests to use that* ware 
tied to those objectives. These teachers 'ditf more testing than did/the other 
teachers, even when .the testing was not required, ' . 

Nevertheless * these same teachers di'd almost no testing when * providing 
instruction outside the HBO system where tests were; less convenient* In 

addition, teachers wt&o followed a textbook closely were more inclined to use 

_ m • ■ . " . / • " . - 

chapter tests than, were other teachers. .- ■ - '- 

■ *.»_*" ** 1 r _ * 

■ - * "\ ' * ' 

There were* however, a few instances in which teachers constructed their 
own tests to serve a partietd^arrunetion, As already noted, placing students, 
in homogeneous groups for Instruction was one occasion. Teachers also saw 
timed tests as serving a useful function in getting students' to memorize 
basic-akill facts. Finally, there was some evidence that the teachers were 
more inclined to construct tests for use when making decisions about students- 
that affect other teachers (i.e., shared decision making) as in the case of 
Jacqueline's team. ^ 



. There was little evidence in our ease studies that the teachers used > 
curriculum- embedded testa to make grading decisions. The G3E survey p however* 
found 95% of the elementary teachers reporting that they used self— made tests 
and 77% reporting they \ised chapter tests for purposes of' determining student 
grades in mathematics (Surry et al., 1982) « incept for Jacqueline, the teach- 
ers we studied appeared to make grading decisions, based on general perceptions 
of daily student performances Tests are a part of daily performance, and per^ 
haps this overlap explains the high rate of test usage *in the CSE results. 
Nevertheless, the results of our case studies call into question the notion 
that testa play an important role in determining grades in mathematics. 

In contrast to curriculum— embedded tests, externally-required tests given 
once a year or less were rarely used by teachers for any of the three cate^ 
gorles of test use considered (Schmidt, 1981). The only ^placement decisions 

based on externally-required tests were for Identifying students to par tici^ 

• ■ . ■ . 

pate in pull-out programs (e*g. % gifted and compensatory education} , and these 
decisions were not the teachers' to make. As for pacing, the frequency of the 
externally— required t as ts was -insufficient for this purpose, arid the teachers 1 
topic selection did not seem to be much influenced by the state minimum eonr= 
petencies test or^the^district-UBed standardised tests, (Schmidt , 1981 ) * Tn . 
fact , surprisingly few teachers had a thorough knowledge of the mathematics 
content *on standardised tests that had been given for some time in their dis^ 
trict* Some of the teachers did provide feedback to parents^bn student per- 
formance on state and standardised tests* However, because the tests were ndt 
viewed as tted to instruction^test performance waSf ndt taken as an indicator 
of performance in a particular teacher's class* 



Conclusion 

% While the "case studies revealed relatively little testing and use of test 
results when making instructional decisions among teachers , neither was there 
evidence that teachers would resist attempts to increase the amount and use of 
testing in their classrooms. Over 70% of the teachers in the CSE survey 
reported beliefs that testing motivates students to study harder and that more 
competency, proficiency, and functional— literacy testing should be done (Burr y 
et al., 1982). m . ^ 

Those who wish to increase the frequency of formal testing in classrooms 
should consider the 8ltuations-tha£ r .-iu this study, encouraged teachers to 
test* The use of an objectives-referenced instructional management system re- 
sulted in more frequent testing. However* the Knoxport MBG system did not 
cover- the entfte- range of content taught, and content outside the HBO system 
was not tested** Further, because a self-paced instructional strategy was 

used, some students were tested less often than students in classrooms in 

■ ** - . . * - ■ ■ ■■ 

other districts (e.g. , one student- only completed three posttests during the 

year).. Teachers who followed a textbook cloaely also "tested regularly, but 

they did not use test results to a great extend to inform their instructional 

decisions , If classroom testing practices are to be changed , teachers must be 

made more aware of the benefits and pitfalls of frequent testing and learn, 

more about how to design and use a testing 'program in their classrooms, 
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